10 research outputs found
Intelligent high-altitude power inspection vision module based on KendryteK210 microcontroller
Now our country has a huge electric power system, it needs a complex electric power transmission network to support
its normal operation. With the development of unmanned aerial vehicle platform and microprocessor technology in recent years, the
unmanned aerial vehicle inspection platform based on microprocessor is an important development direction of power transmission network
maintenance. Based on this background, this paper designs an intelligent high-altitude line inspection vision module based on KendryteK210
microcontroller. The module can be used as a UAV load to carry out efficient power line patrol work, and wireless communication is
carried out by ESP8285Wi-Fi. First of all, the inspection vision module uses OV2640 visible light camera to complete the target image
data acquisition. Then, in the process of data processing, the least square method and Theil-Sen regression algorithm are combined to get
the target line object, so as to get the slope and length of the line object and other parameters. Finally, the target in the image was identifi ed
based on the yolov2 neural network model, and then the fl ight path instruction was provided for the UAV platform
Progressive Text-to-Image Diffusion with Soft Latent Direction
In spite of the rapidly evolving landscape of text-to-image generation, the
synthesis and manipulation of multiple entities while adhering to specific
relational constraints pose enduring challenges. This paper introduces an
innovative progressive synthesis and editing operation that systematically
incorporates entities into the target image, ensuring their adherence to
spatial and relational constraints at each sequential step. Our key insight
stems from the observation that while a pre-trained text-to-image diffusion
model adeptly handles one or two entities, it often falters when dealing with a
greater number. To address this limitation, we propose harnessing the
capabilities of a Large Language Model (LLM) to decompose intricate and
protracted text descriptions into coherent directives adhering to stringent
formats. To facilitate the execution of directives involving distinct semantic
operations-namely insertion, editing, and erasing-we formulate the Stimulus,
Response, and Fusion (SRF) framework. Within this framework, latent regions are
gently stimulated in alignment with each operation, followed by the fusion of
the responsive latent components to achieve cohesive entity manipulation. Our
proposed framework yields notable advancements in object synthesis,
particularly when confronted with intricate and lengthy textual inputs.
Consequently, it establishes a new benchmark for text-to-image generation
tasks, further elevating the field's performance standards.Comment: 14 pages, 15 figure
Dynamic Feature Pruning and Consolidation for Occluded Person Re-Identification
Occluded person re-identification (ReID) is a challenging problem due to
contamination from occluders, and existing approaches address the issue with
prior knowledge cues, eg human body key points, semantic segmentations and etc,
which easily fails in the presents of heavy occlusion and other humans as
occluders. In this paper, we propose a feature pruning and consolidation (FPC)
framework to circumvent explicit human structure parse, which mainly consists
of a sparse encoder, a global and local feature ranking module, and a feature
consolidation decoder. Specifically, the sparse encoder drops less important
image tokens (mostly related to background noise and occluders) solely
according to correlation within the class token attention instead of relying on
prior human shape information. Subsequently, the ranking stage relies on the
preserved tokens produced by the sparse encoder to identify k-nearest neighbors
from a pre-trained gallery memory by measuring the image and patch-level
combined similarity. Finally, we use the feature consolidation module to
compensate pruned features using identified neighbors for recovering essential
information while disregarding disturbance from noise and occlusion.
Experimental results demonstrate the effectiveness of our proposed framework on
occluded, partial and holistic Re-ID datasets. In particular, our method
outperforms state-of-the-art results by at least 8.6% mAP and 6.0% Rank-1
accuracy on the challenging Occluded-Duke dataset.Comment: 12 pages, 9 figure
Fine-grained Appearance Transfer with Diffusion Models
Image-to-image translation (I2I), and particularly its subfield of appearance
transfer, which seeks to alter the visual appearance between images while
maintaining structural coherence, presents formidable challenges. Despite
significant advancements brought by diffusion models, achieving fine-grained
transfer remains complex, particularly in terms of retaining detailed
structural elements and ensuring information fidelity. This paper proposes an
innovative framework designed to surmount these challenges by integrating
various aspects of semantic matching, appearance transfer, and latent
deviation. A pivotal aspect of our approach is the strategic use of the
predicted space by diffusion models within the latent space of diffusion
processes. This is identified as a crucial element for the precise and natural
transfer of fine-grained details. Our framework exploits this space to
accomplish semantic alignment between source and target images, facilitating
mask-wise appearance transfer for improved feature acquisition. A significant
advancement of our method is the seamless integration of these features into
the latent space, enabling more nuanced latent deviations without necessitating
extensive model retraining or fine-tuning. The effectiveness of our approach is
demonstrated through extensive experiments, which showcase its ability to
adeptly handle fine-grained appearance transfers across a wide range of
categories and domains. We provide our code at
https://github.com/babahui/Fine-grained-Appearance-TransferComment: 14 pages, 15 figure
CT-based radiomics for predicting radio-chemotherapy response and overall survival in nonsurgical esophageal carcinoma
BackgroundTo predict treatment response and 2 years overall survival (OS) of radio-chemotherapy in patients with esophageal cancer (EC) by radiomics based on the computed tomography (CT) images.MethodsThis study retrospectively collected 171 nonsurgical EC patients treated with radio-chemotherapy from Jan 2010 to Jan 2019. 80 patients were randomly divided into training (n=64) and validation (n=16) cohorts to predict the radiochemotherapy response. The models predicting treatment response were established by Lasso and logistic regression. A total of 156 patients were allocated into the training cohort (n=110), validation cohort (n=23) and test set (n=23) to predict 2-year OS. The Lasso Cox model and Cox proportional hazards model established the models predicting 2-year OS.ResultsTo predict the radiochemotherapy response, WFK as a radiomics feature, and clinical stages and clinical M stages (cM) as clinical features were selected to construct the clinical-radiomics model, achieving 0.78 and 0.75 AUC (area under the curve) in the training and validation sets, respectively. Furthermore, radiomics features called WFI and WGI combined with clinical features (smoking index, pathological types, cM) were the optimal predictors to predict 2-year OS. The AUC values of the clinical-radiomics model were 0.71 and 0.70 in the training set and validation set, respectively.ConclusionsThis study demonstrated that planning CT-based radiomics showed the predictability of the radiochemotherapy response and 2-year OS in nonsurgical esophageal carcinoma. The predictive results prior to treatment have the potential to assist physicians in choosing the optimal therapeutic strategy to prolong overall survival
Robust estimation of bacterial cell count from optical density
Optical density (OD) is widely used to estimate the density of cells in liquid culture, but cannot be compared between instruments without a standardized calibration protocol and is challenging to relate to actual cell count. We address this with an interlaboratory study comparing three simple, low-cost, and highly accessible OD calibration protocols across 244 laboratories, applied to eight strains of constitutive GFP-expressing E. coli. Based on our results, we recommend calibrating OD to estimated cell count using serial dilution of silica microspheres, which produces highly precise calibration (95.5% of residuals <1.2-fold), is easily assessed for quality control, also assesses instrument effective linear range, and can be combined with fluorescence calibration to obtain units of Molecules of Equivalent Fluorescein (MEFL) per cell, allowing direct comparison and data fusion with flow cytometry measurements: in our study, fluorescence per cell measurements showed only a 1.07-fold mean difference between plate reader and flow cytometry data
Recommended from our members
Computer Vision System-On-Chip Designs for Intelligent Vehicles
Intelligent vehicle technologies are growing rapidly that can enhance road safety, improve transport efficiency, and aid driver operations through sensors and intelligence. Advanced driver assistance system (ADAS) is a common platform of intelligent vehicle technologies. Many sensors like LiDAR, radar, cameras have been deployed on intelligent vehicles. Among these sensors, optical cameras are most widely used due to their low costs and easy installation. However, most computer vision algorithms are complicated and computationally slow, making them difficult to be deployed on power constraint systems. This dissertation investigates several mainstream ADAS applications, and proposes corresponding efficient digital circuits implementations for these applications. This dissertation presents three ways of software / hardware algorithm division for three ADAS applications: lane detection, traffic sign classification, and traffic light detection. Using FPGA to offload critical parts of the algorithm, the entire computer vision system is able to run in real time while maintaining a low power consumption and a high detection rate. Catching up with the advent of deep learning in the field of computer vision, we also present two deep learning based hardware implementations on application specific integrated circuits (ASIC) to achieve even lower power consumption and higher accuracy.
The real time lane detection system is implemented on Xilinx Zynq platform, which has a dual core ARM processor and FPGA fabric. The Xilinx Zynq platform integrates the software programmability of an ARM processor with the hardware programmability of an FPGA. For the lane detection task, the FPGA handles the majority of the task: region-of-interest extraction, edge detection, image binarization, and hough transform. After then, the ARM processor takes in hough transform results and highlights lanes using the hough peaks algorithm. The entire system is able to process 1080P video stream at a constant speed of 69.4 frames per second, realizing real time capability.
An efficient system-on-chip (SOC) design which classifies up to 48 traffic signs in real time is presented in this dissertation. The traditional histogram of oriented gradients (HoG) and support vector machine (SVM) are proven to be very effective on traffic sign classification with an average accuracy rate of 93.77%. For traffic sign classification, the biggest challenge comes from the low execution efficiency of the HoG on embedded processors. By dividing the HoG algorithm into three fully pipelined stages, as well as leveraging extra on-chip memory to store intermediate results, we successfully achieved a throughput of 115.7 frames per second at 1080P resolution. The proposed generic HoG hardware implementation could also be used as an individual IP core by other computer vision systems.
A real time traffic signal detection system is implemented to present an efficient hardware implementation of the traditional grass-fire blob detection. The traditional grass-fire blob detection method iterates the input image multiple times to calculate connected blobs. In digital circuits, five extra on-chip block memories are utilized to save intermediate results. By using additional memories, all connected blob information could be obtained through one-pass image traverse. The proposed hardware friendly blob detection can run at 72.4 frames per second with 1080P video input. Applying HoG + SVM as feature extractor and classifier, 92.11% recall rate and 99.29% precision rate are obtained on red lights, and 94.44% recall rate and 98.27% precision rate on green lights.
Nowadays, convolutional neural network (CNN) is revolutionizing computer vision due to learnable layer by layer feature extraction. However, when coming into inference, CNNs are usually slow to train and slow to execute. In this dissertation, we studied the implementation of principal component analysis based network (PCANet), which strikes a balance between algorithm robustness and computational complexity. Compared to a regular CNN, the PCANet only needs one iteration training, and typically at most has a few tens convolutions on a single layer. Compared to hand-crafted features extraction methods, the PCANet algorithm well reflects the variance in the training dataset and can better adapt to difficult conditions. The PCANet algorithm achieves accuracy rates of 96.8% and 93.1% on road marking detection and traffic light detection, respectively. Implementing in Synopsys 32nm process technology, the proposed chip can classify 724,743 32-by-32 image candidates in one second, with only 0.5 watt power consumption.
In this dissertation, binary neural network (BNN) is adopted as a potential detector for intelligent vehicles. The BNN constrains all activations and weights to be +1 or -1. Compared to a CNN with the same network configuration, the BNN achieves 50 times better resource usage with only 1% - 2% accuracy loss. Taking car detection and pedestrian detection as examples, the BNN achieves an average accuracy rate of over 95%. Furthermore, a BNN accelerator implemented in Synopsys 32nm process technology is presented in our work. The elastic architecture of the BNN accelerator makes it able to process any number of convolutional layers with high throughput. The BNN accelerator only consumes 0.6 watt and doesn't rely on external memory for storage. </P
WWP2 Regulates Kidney Fibrosis and the Metabolic Reprogramming of Profibrotic Myofibroblasts
Background: Renal fibrosis is a common pathological endpoint in CKD that is challenging to reverse, and myofibroblasts are responsible for the accumulation of a fibrillar collagen-rich extracellular matrix (ECM). Recent studies have unveiled myofibroblasts diversity in terms of proliferative and fibrotic characteristics, which are linked to different metabolic states. We previously demonstrated the regulation of ECM genes and tissue fibrosis by WWP2, a multifunctional E3 ubiquitin-protein ligase. Here, we investigate WWP2 in renal fibrosis and in the metabolic reprograming of myofibroblasts in CKD.
Methods: We used kidney samples from CKD patients and WWP2 -null kidney disease mice models, and leveraged single cell RNA-seq analysis to detail the cell-specific regulation of WWP2 in fibrotic kidneys. Experiments in primary cultured myofibroblasts by bulk-RNA seq, ChIP-seq, metabolomics and cellular metabolism assays, were used to study the metabolic regulation of WWP2 and its downstream signaling.
Results: The tubulointerstitial expression of WWP2 was associated with fibrotic progression in CKD patients and in murine kidney disease models. WWP2 deficiency promoted myofibroblast proliferation and halts pro-fibrotic activation, reducing the severity of kidney fibrosis in vivo . In renal myofibroblasts, WWP2 deficiency increased fatty acid oxidation and activated the pentose phosphate pathway, boosting mitochondrial respiration at the expense of glycolysis. WWP2 suppressed the transcription of PGC-1α, a metabolic mediator of fibrotic response, and pharmacological inhibition of PGC-1α partially abrogated the protective effects of WWP2 deficiency on myofibroblasts.
Conclusions: WWP2 regulates the metabolic reprogramming of profibrotic myofibroblasts by a WWP2-PGC-1α axis, and WWP2 deficiency protects against kidney fibrosis in CKD